HAMR: high-throughput annotation of modified ribonucleotides.

نویسندگان

  • Paul Ryvkin
  • Yuk Yee Leung
  • Ian M Silverman
  • Micah Childress
  • Otto Valladares
  • Isabelle Dragomir
  • Brian D Gregory
  • Li-San Wang
چکیده

RNA is often altered post-transcriptionally by the covalent modification of particular nucleotides; these modifications are known to modulate the structure and activity of their host RNAs. The recent discovery that an RNA methyl-6 adenosine demethylase (FTO) is a risk gene in obesity has brought to light the significance of RNA modifications to human biology. These noncanonical nucleotides, when converted to cDNA in the course of RNA sequencing, can produce sequence patterns that are distinguishable from simple base-calling errors. To determine whether these modifications can be detected in RNA sequencing data, we developed a method that can not only locate these modifications transcriptome-wide with single nucleotide resolution, but can also differentiate between different classes of modifications. Using small RNA-seq data we were able to detect 92% of all known human tRNA modification sites that are predicted to affect RT activity. We also found that different modifications produce distinct patterns of cDNA sequence, allowing us to differentiate between two classes of adenosine and two classes of guanine modifications with 98% and 79% accuracy, respectively. To show the robustness of this method to sample preparation and sequencing methods, as well as to organismal diversity, we applied it to a publicly available yeast data set and achieved similar levels of accuracy. We also experimentally validated two novel and one known 3-methylcytosine (3mC) sites predicted by HAMR in human tRNAs. Researchers can now use our method to identify and characterize RNA modifications using only RNA-seq data, both retrospectively and when asking questions specifically about modified RNA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chemical Modifications Mark Alternatively Spliced and Uncapped Messenger RNAs in Arabidopsis.

Posttranscriptional chemical modification of RNA bases is a widespread and physiologically relevant regulator of RNA maturation, stability, and function. While modifications are best characterized in short, noncoding RNAs such as tRNAs, growing evidence indicates that mRNAs and long noncoding RNAs (lncRNAs) are likewise modified. Here, we apply our high-throughput annotation of modified ribonuc...

متن کامل

Revealing the Elusive Plant Epitranscriptome

RNA is decorated by various chemical modifications that affect the stability and localization of this fragilemolecule. These diverse, highly conserved, covalent modifications also help RNA perform its crucial roles in the cell. Over 100 RNAmodifications have been identified, primarily in noncoding RNAs (e.g., tRNA and rRNA). Less is known about the chemical modification of mRNA, although there ...

متن کامل

Revealing the Elusive Plant Epitranscriptome

RNA is decorated by various chemical modifications that affect the stability and localization of this fragilemolecule. These diverse, highly conserved, covalent modifications also help RNA perform its crucial roles in the cell. Over 100 RNAmodifications have been identified, primarily in noncoding RNAs (e.g., tRNA and rRNA). Less is known about the chemical modification of mRNA, although there ...

متن کامل

RNA secondary structure prediction using high-throughput SHAPE.

Understanding the function of RNA involved in biological processes requires a thorough knowledge of RNA structure. Toward this end, the methodology dubbed "high-throughput selective 2' hydroxyl acylation analyzed by primer extension", or SHAPE, allows prediction of RNA secondary structure with single nucleotide resolution. This approach utilizes chemical probing agents that preferentially acyla...

متن کامل

Annotation confidence score for genome annotation: a genome comparison approach

MOTIVATION The massively parallel sequencing technology can be used by small research labs to generate genome sequences of their research interest. However, annotation of genomes still relies on the manual process, which becomes a serious bottleneck to the high-throughput genome projects. Recently, automatic annotation methods are increasingly more accurate, but there are several issues. One im...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • RNA

دوره 19 12  شماره 

صفحات  -

تاریخ انتشار 2013